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Abstract — We consider two fundamental tasks in quantum 
information theory, data compression with quantum side in- 
formation as well as randomness extraction against quantum 
side information. We characterize these tasks for general sources 
using so-called one-shot entropies. These characterizations — in 
contrast to earlier results — enable us to derive tight second order 
asymptotics for these tasks in the i.i.d. limit. More generally, 
our derivation establishes a hierarchy of information quantities 
that can be used to investigate information theoretic tasks in 
the quantum domain: The one-shot entropies most accurately 
describe an operational quantity, yet they tend to be difficult to 
calculate for large systems. We show that they asymptotically 
agree (up to logarithmic terms) with entropies related to the 
quantum and classical information spectrum, which are easier 
to calculate in the i.i.d. limit. Our technique also naturally yields 
bounds on operational quantities for finite block lengths. 

Index Terms — source compression, randomness extraction, fi- 
nite block length, second order asymptotics, one-shot entropies, 
information spectrum 

I. Introduction 

THE characterization of information theoretic tasks that 
are repeated only once (the one-shot setting) or a finite 
number of times (the finite block length setting) has recently 
generated great interest in classical information theory Q], 

0. In particular, these studies investigate the asymptotic 
performance of information theoretic tasks in the second order, 

1. e., they determine precisely the contribution to the rate that 
is proportional to 1/y/n when we consider n independent and 
identically distributed (i.i.d.) repetitions of a task. Among the 
tasks that have been studied in this way are noiseless source 
coding Q, j4|, Slepian-Wolf coding |5), (6), random number 
generation when the source distribution is known [4|, the 
classical statistical evaluation used for parameter estimation 
in quantum cryptography [7], and channel coding [1|, [2|, (8). 

Concurrently, progress has been made towards character- 
izing tasks utilizing quantum resources in the same setting. 
Two different, but related [9|, techniques have been proposed 
to achieve this: one-shot entropies [ 10 1 and a quantum gener- 
alization 1111 . Ifl2ll of the information spectrum method |13|, 
IH41 . The one-shot approach provides bounds on operational 
quantities in terms of entropies for general sources. These can 
be computed for small examples, but are generally difficult 
to calculate even in the i.i.d. case. We relate these entropies 
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to the information spectrum of the source, which can be 
approximated in the i.i.d. setting flU to yield an asymptotic 
second order expansion. Combining the two techniques, we 
thus derive a second order expansion of operational quantities. 
This is the first such expansion in the quantum regime. 
We first give a brief overview of the two techniques. 

A. One-Shot Entropies 

Motivated by classical cryptography, Renner and coworkers 
generalized the min-entropy (i.e. the Renyi entropy lfl5l of 
order oo) to the quantum setting and used it to investigate 
randomness extraction against quantum adversaries lfl6l . (lOQ 
Together with a technique called smoothing, i.e. an optimiza- 
tion of the min-entropy over close states, this result implies a 
direct bound on randomness extraction against quantum side 
information, which is tight in the first order 1 10 1. Subsequently, 
the smooth entropy framework has been refined [ 17 1, |18|, |19] 
and used to characterize other tasks in quantum information 
theory, for example source coding GUI , state merging 1211 . 
and quantum channel coding |22|. 

Generally, these results consider operational quantities in 
the one-shot setting and provide direct and converse bounds 
on them that are valid for general sources and channels. In 
this work, given a source that emits a random variable, X, 
and (potentially quantum) side information, B, about X, the 
following two operational quantities are considered. 

• The maximal number of random and secret bits, e-close to 
uniform and independent of B, that can be extracted from 
X is denoted £ £ (X\B). This task was first investigated 
by Renner and Konig Ifl6l in the quantum setting and has 
various applications in cryptography. 

• The minimal length in bits of an encoding M of X, such 
that X can be recovered up to an error e from B and M 
is denoted m e (X\B). This is noiseless source compres- 
sion with side information and has been investigated by 
Devetak and Winter |23| in the quantum setting. 

Direct and converse bounds on these operational quantities 
are then given in terms of quantities we henceforth call 
one-shot entropies, of which Renner's smooth min-entropy, 
H^ in (X\B), is the most prominent example. These entropies 
exhibit useful monotonicity properties, similar to those of the 
Shannon entropy. Moreover, they can be evaluated numeri- 
cally for small examples and asymptotically converge to the 
conditional von Neumann entropy. 

'This is also known as the Leftover Hash Lemma in cryptography. 
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For the operational quantities mentioned above, their one- 
shot characterization in [10| and |20| directly implies that [19] 



t(X n \B n ) = nH{X\B) + 0{^fn) and 
m £ (X n \B n ) = nH{X\B) + O(Vn), 



(1) 



where < e < 1 and the operational quantities are evaluated 
for n i.i.d. copies of the source. This should be read as follows: 
independent of the allowed error e, the tasks can be achieved if 
and only if the rate is below the Shannon limit, H(X\B). Note 
that these asymptotic results can be proven more directly, as 
has been done in 11231 for source coding with side information. 
However, in addition to the asymptotic results, the one-shot 
approach often naturally yields bounds for finite block lengths, 
i.e., explicit expressions for the terms 0(y/n) for finite n. 



B. The Information Spectrum Method 

The information spectrum technique has been introduced 
by Han and Verdu lfl3l as a method to treat general in- 
formation sources beyond the i.i.d. scenario. Han succeeded 
in treating many major topics in information theory from a 
unified viewpoint [ 14 1 by describing the asymptotic optimal 
performance in terms of the asymptotic stochastic behavior of 
the logarithmic likelihood ratio and the logarithmic likelihood. 
Recently, the information spectrum method was employed to 
analyze the second order asymptotics of various tasks [4], most 
prominently channel coding (2). 

1) Classical Information Spectrum: Given a probability 
distribution, P(x), and a second, not necessarily normalized, 
distribution, Q(x), the logarithmic likelihood ratio is the ran- 
dom variable Z = log P(X) — log Q(X) under the probability 
distribution P. To investigate tasks in the i.i.d. limit in first 
order, it is often sufficient to investigate the expectation value 
of the likelihood ratio, which evaluates to the relative entropy, 
E[Z] = D(P\\Q) = £ a P(aO(logP(a:) - logQ(x)). 

However, going beyond i.i.d. sources, the information spec- 
trum method is concerned with the full spectrum of this 
random variable. In this work, we thus introduce the classical 
entropic information spectrum, 



D e s (P\\Q) = sup{R e M | P{Z <R}< e}. 



(2) 



(The relation of this quantity to a more traditional formulation 
of the information spectrum is discussed in Section fVHI| ) 

The following crucial observation highlights the usefulness 
of the classical information spectrum to derive a second order 
expansion. We consider n-fold i.i.d distributions P n (x) = 
Yli P( x i) an d Q n {%) — IL Q( x i)- Then, the classical entropic 
information spectrum evaluates to 

D £ s (P n \\Q n ) = sup ji? € K P™{ Zi< R] < 4> 

where Zi — log P(Xi) — \ogQ(Xi) and we are thus left with 
the spectrum of a sum of i.i.d. random variables. In this case, 
the law of large numbers implies that the distribution of 

£ ■ Zi- nfi ^ ^ = E ^ ^ ^ = Je[(Z- p) 2 ], 



converges to the normal distribution. More precisely, the 
Berry-Esseen theorem implies that 

Dl(P n \\Q n ) = nD(P\\Q) + VnsiPWQ)®- 1 ^) + 0(1), 

where a = s(P\\Q) is the standard deviation of the logarithmic 
likelihood ratio and $ is the cumulative normal distribution. 
The information spectrum thus has a natural second order 
expansion in the i.i.d. limit. (This derivation is presented in 
more detail in Section [YT]) 

2) Quantum Information Spectrum: A first quantum exten- 
sion of the information spectrum was investigated by Nagaoka 
and one of the present authors JTTJ, lfT2l in order to treat 
classical-quantum (CQ) channel coding and hypothesis testing 
for general sequences of channels and sources. In [12|, they 
also clarified the relation between CQ channel coding and 
quantum hypothesis testing (see also [ 24 1 , [25|, l26l . G71 for 
important contributions to quantum hypothesis testing). The 
quantum entropic information spectrum is denoted D e s (p\\a), 
and reduces to |2} if p and a commute. (We will define it in 
Section [V]) 

However, in contrast to the classical information spectrum, 
its quantum extension is difficult to calculate or approximate, 
even in the i.i.d. limit, due to the non-commutativity of p 
and a. A potential remedy was proposed by Hiai and Petz l26l 
in the context of hypothesis testing. They considered the joint 
measurement of a and £ a {p), where the latter is modified from 
p by pinching in the spectral decomposition of a. In |28|, 
these two density operators were then used to introduce an 
alternative quantum extension of the information spectrum, 
D e s (£ a (p)\\(j). Since the operators a and £ a (p) commute, this 
quantum version can be treated similar to the information 
spectrum of two classical distributions. Moreover, Hiai and 
Petz 06) showed that D{£ <jn (p n )\\a n ) = nD(p\\a) + o(n) 



for i.i.d. product states p n 



and er™ 



This 



analysis was generalized [28 1 to show that D £ s {£ a n.[p n )\\o' n ) = 
D(p\\<r) +o(n). However, a major drawback of this definition 
is that £ a n(p n ) is generally not i.i.d. even if p n and a n are 
i.i.d product states, which makes a second order evaluation of 
the information spectrum in the asymptotic limit difficult. 

Furthermore, in order to show the converse part of the 
quantum Chernoff bound in hypothesis testing, Nussbaum 
and Szkola ll29l introduced a pair of distributions related to 
p and a, which inherit the i.i.d. structure of p and a and 
have the convenient property that the first two moments of 
their likelihood ratio coincides with the first two moments 
of the likelihood ratio of p and a. Using the eigenvalue 
decompositions p = Y, x r A v x){v x \ and a = J2 V s y\ u v)( u vl 
these distributions are given by 

Pp,„{x, y) = r x \(v x \u y )\ 2 and Q p ^{x, y) = s y \(v x \u y )\ 2 , 

and their entropic information spectrum, D e s {P p (yi Qp,a)< will 
play an important role in our analysis. 

C. Main Results 

In this work, we improve the analysis leading to the asymp- 
totic expansion of the operational quantities in Eq. ([T]i, relying 
on techniques developed for the smooth entropy framework as 
well as the quantum information spectrum method. 
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1) Second Order Expansion of Operational Quantities: 
Our first contribution is to show that both the direct and 
converse bounds on the operational quantities converge to the 
same expression in the second order. In particular, we find 
the following asymptotic expansion for n i.i.d. copies of the 
source and < e < 1: 

t{X n \B n ) = nH{X\B) + % /^s(X|B)$- 1 (e 2 ) + O(logn), 
m £ {X n \B n ) = nH{X\B) - ^ s(XlB)^- 1 (e) + O(logra), 

where H(X\B) is the conditional von Neumann entropy of 
the source, <P is the cumulative normal distribution function, 
and s describes the standard deviation corresponding to the 
conditional von Neumann entropy. (See Corollaries [T6| and [T3] ) 
Note, in particular, that $ _1 (e) is negative for small e and 
changes sign when e exceeds 1/2. To the best of our knowl- 
edge, this constitutes the first second order expansion of an 
operational quantity involving quantum resources. 

2) Finite Block Length Analysis: Our analysis naturally 
yields both direct and converse bounds on the operational 
quantities for finite n, which can be evaluated numerically. 

We give an example of such a finite block length analysis 
in Figure [T] For this purpose, we consider the state that results 
when transmitting either |0) or |1) through the complementary 
channel of a Pauli channel with a phase error p = 0.05 that is 
independent of the bit flip error. The resulting state is 

Pxb = ||0><0] ® + |]1X1| ® I Wl, where 

We are interested in the rate r — -£ £ (X n \B n ) at which 
uniform and independent randomness can be extracted from 
X if we require that e = 10 -6 . In the first order, the rate 
approaches H(X\B) = 1 — h(p) s» 0.714. The deviation 
from this bound in the second order is significant for small 
n. We have s(X\B)^~ 1 (e 2 ) ~ —9.6 for this example, which 
leads to a drop of 10% in the rate at n rs 1.8 • 10 4 . Converse 
bounds for finite n are relevant as they allow us to investigate 
how close the performance of a given protocol is to the 
maximal achievable rate. From Fig. [T] we can deduce that it is 
impossible to securely extract more then 95% of the Shannon 
entropy for n = 10 4 . The calculations leading to the direct 
and converse bounds on £ £ (X n \B n ) p for this example are 
discussed in Appendix |A| 

3) Tight One-Shot Characterization: In order to prove our 
results, we first find a suitable characterizations for £ £ and 
m £ in terms of one-shot entropies. We bound t £ in terms of 
the smooth min-entropy, H^ in (X\B) of the source, and use 
a conditional version of the hypothesis testing entropy |30|, 
Hfi(X\B), to bound m £ . This results in the following bounds, 
for any < r\ < e < 1 (cf. Theorem [8] and [9]): 

K^(X\B) - log 1 - 3 < £ £ (X\B) < H £ nia (X\B), 

H £ h (X\B) < m £ {X\B) < H £ -\X\B) + log ± + 3. (3) 

Here, s is a parameter of the problem and is kept constant, 
whereas rj can be optimized over. 




Fig. I. This plot shows direct and converse bounds on the extraction rate 
for n £ [10 4 , 10 s ] . For increasing n the bounds first converge to each other, 
then to the second order asymptotics, and finally to the Shannon rate. 

These one-shot characterizations are tighter than earlier 
results in ifTOl . Il20l . In particular, even if we choose rj = 
poly(n) -1 , the additive terms grow at most as logn for large 
enough n. The smoothing parameter of the one-shot entropies 
(in both bounds) thus inherits the operational interpretation 
of £ as the allowed error. This is of crucial importance as the 
second order expansion we aim to show is a function of e, 
and we thus need to find lower and upper bounds in terms of 
the same parameter. 

4) Hierarchy of Information Quantities: Furthermore, we 
establish a hierarchy of information quantities (cf. Figure [2]) 
that can be used to analyze quantum information tasks beyond 
the examples discussed above. The hierarchy is partly inspired 
by recent results in hypothesis testing and constitutes the main 
technical contribution of this paper. 

The operational quantities (Class 1) describe the optimal 
achievable performance of a task, and they depend strongly 
on the exact specification of the considered task. For example, 
in the case of randomness extraction, this quantity depends on 
the precise security requirement we impose on the extracted 
random variable. To calculate these quantities, one needs to 
optimize the performance over all valid protocols, which is 
difficult even for small examples. In a first step, we thus 
bound the operational quantities in terms of one-shot entropies 
(Class 2), in our case and H^. These quantities can 

be formulated as semi-definite optimization problems (SDPs^ 
and can therefore be calculated for small examples. 

In the i.i.d. setting for large block lengths (e.g., n 3> 10), 
however, these optimization problems quickly become in- 
tractable as their complexity scales exponentially in n. Thus, 
we relate the one-shot entropies to the quantum information 
spectrum (Class 3) and then the classical information spectrum 
of the corresponding Nussbaum-Szkola distributions (Class 4), 
which can often be approximated even for large n. This can 
be done incurring an additive error term that scales at most 
logarithmically in n. Finally, the classical entropic information 

2 The min-entropy can be formulated as an SDP 1311 and an extension of 
this to the smooth min-entropy is possible. The SDP for hypothesis testing is 
discussed in Section fill 
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Class 


Role 


Quantities 


Class 1 


Describes the optimal achievable performance. 

The calculation is very difficult, even for small examples. 


m £ (X\B) 
l e {X\B), 


Class 2 


Bound for general sources in terms of one-shot entropies. 
The calculation is possible for small examples, using an SDR 


Hf ±r >{X\B), H^(X\B), 
D £ h ±v ( P h), D^MW) 


Class 3 


Quantum version of the entropic information spectrum. 


D £±d {p\\a), Dl ±d {£ a {p)\\(j) 


Class 4 


Classical entropic information spectrum. 

The calculation is approximately possible for i.i.d. sources. 


D^^PpAQp.a) 


Class 5 


Asymptotic expansion in the second order. 
The calculation is easy for arbitrarily large n. 


nD{p\\o) + ,Jna{ P \\<T)$- L {e) 
nH(X\B) + y /ris(X\B)^- 1 (e) 



Classes 


Difference 


Method 


Class 1 -> Class 2 


O(logn) with r\ oc -4= 


One-shot analysis, random coding and monotonicity (cf. Thms. 


9 and 8 I. 


Class 2 Class 4 


O(logn) with 5 oc 


Relations between relative entropies (cf. Prop. 13 and Thm. 14 




Class 4 -> Class 5 


0(1) 


Berry-Esseen Theorem (cf. Eq. (30|>). 



Fig. 2. Hierarchy of information quantities. We consider operational quantities for a constant e £ (0, 1) and n i.i.d. uses of the source. The one-shot entropies 
(Class 2) provide a "microscopic" analysis of the optimal performance of a task (Class 1) for general sources, whereas the information spectrum (Classes 3 
and 4) and their asymptotic expansion (Class 5) give a "macroscopic" view that can be approximately calculated for sources with sufficient structure. (Note 
that Class 3 and Class 4 are unified if p and a commute.) 



spectrum allows us to evaluate the second order asymptotic 
expansion precisely (Class 5). 

Formally, we show that the following quantities are equiv- 



alent in an appropriate sense (cf. Proposition 13 and Theo- 
rem [14): 

D^(pW) « DUpW) « Dl(p\\a) 

ttDt(e a (p)\W)*D s s (p P! 4Q P}tT ). 

First, note that e is varied by an additive optimization pa- 
rameter in some relations, analogously to the situation in 
Eq. More importantly, the equivalence only holds up to 
additive terms log#(er), where 9(a) is the logarithm of the 
ratio between the largest and smallest eigenvalue of a. In the 
i.i.d. setting, it is evident that 9(a n ) grows at most linearly in n 
and this additive term thus grows at most as O(logn). Hence, 
our results imply that the smoothing parameter for all these 
quantities can be chosen as e ± poly(n) -1 without incurring 
a penalty that grows faster than O(logn). 

5) Convergence to Relative Entropy: Our analysis also 
improves on earlier work iflOl . Qj |, 1 32 1, 1 19 1 that investigated 
the convergence of one-shot entropies in the i.d.d. setting for 
finite n. Given a quantum state p and an a positive semi- 
definite operator er, these earlier results imply that the i.i.d. 
product states p n and a n satisfy 

D s max (p n \W n ) = nD(p\\a) + O(Vn), and 
D%(p n \\a n ) = nD(p\\a)+0(V^), 

where Df is the hypothesis testing entropy and -D^ lax a relative 
entropy version of the smooth min-entropy. These results also 
establish explicit upper and lower bounds on the convergence 
for finite n, however, the second order term, scaling as y/n, is 
not tight. Our analysis implies improved bounds for finite n, 
which are tight in the second order. We find 

D s max (p n h n ) = nD(p\\a) - s{p\\o)*- x (e 2 ) + O(logn), 
D e h {p n h n ) = nD(p\\(T) + ^ s (p|| ( r)$- 1 (e) + O(logn). 



(These statements are shown in Section [VI- A| ) 

We also point the reader to independent and concurrent work 
by Li (33), who also recovers the i.i.d. second order asymp- 
totics for hypothesis testing using a more direct approach. 

D. Outline 

The remainder of the paper is structured as follows. In Sec- 
tion [II] we introduce the notation and the one-shot entropies 
required for our discussion. We show some properties of these 
one-shot entropies, which we will need for subsequent proofs. 
In Sections [HI] and [IV] we characterize source compression 
and randomness extraction with quantum side information in 
the one-shot setting, respectively. Section [V] then introduces 
relations between different information quantities which are 
used extensively to derive our asymptotic results. Section [VI] 
is devoted to the analysis of the two operational tasks in the 
i.i.d. asymptotic setting and Section VII covers finite block 



lengths. Finally, Section VIII illuminates the relation between 



the traditional formulation of the information spectrum method 
and its entropic version used in this work. 

II. Preliminaries 
A. Notation and Definitions 

For a finite-dimensional Hilbert space "K, we denote by 
£(JC) and V(Ji) linear and positive semi-definite operators 
on JC, respectively. Quantum states are in the set 6>(!K) = 
{p £ V(!K) : tr(p) = 1} and we also define the set of sub- 
normalized quantum states, <5><(5t) = {p e Vi^rC) : < 
tr(/o) < 1}- On £(!K), we employ the Schatten oo-norm || • ||, 
which evaluates to the largest singular value, and the Hilbert- 
Schmidt inner product (L, M) = tr(L^M). 

We write A > B if and only if A - B € V(0i). When 
comparing a scalar to a matrix, we implicitly assume that it is 
multiplied by the identity matrix, which we denote by 1, We 
use {A > B} to denote the projector onto the space spanned 
by the eigenvectors of A — B that corresponds to non-negative 
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eigenvalues. Clearly, this definition implies that {A > B}A > 
{A > B}B. Moreover, we have {A > B} = 1 - {A < B}. 

Multipartite quantum systems are described by tensor prod- 
uct spaces. We use capital letters to denote the different sys- 
tems and subscripts to indicate on what subspace an operator 
acts. For example, if Lab is an operator on 3-C ab — "Ka^^-b, 
then La = ^b{Lab) is implicitly defined as its marginal 
on A. Conversely, we call Lab an extension of La- We call 
a state classical-quantum (CQ) if it is of the form pxA = 



Y.xPx\x){x\ 



where cf> A G S(^Ka), Px a probability 



distribution, and {\x)} an orthonormal basis of 3~Cx- We call 
X a register to distinguish it from genuinely quantum systems. 

A map £ : C(CK) —> C(W) is called a completely positive 
map (CPM) if it maps P(!K ® JC") into V{VL' <g> JC") for any 
JC". It is called trace-preserving (TP) if tr(£ (X)) = tr(X) for 
any X £ V(Ji). It is called unital if £(1) = 1 and sub-unital 
if £(1) < 1. The adjoint map £^ of a map £ is defined via the 
relation (£(L), M) = (L, £ f (M)) for all L, M. Adjoint maps 
of CPMs are CPMs. Moreover, adjoint maps of TP-CPMs 
are unital CPMs and vice versa. As an example, consider the 
pinching TP-CPM M X - L i-» J2 X \ X )( X \ L \ X )( X \ and note that 
this map is self-adjoint with regards to the Hilbert Schmidt 
inner product and, thus, also unital. 

The following result generalizes pinching. 

Lemma 1: Let A e V{0i) and let £ : £(J£) -> £(Jf') be a 
sub-unital CPM. Then, \\£(A)\\ < \\A\\. 

Proof: We may write the Schatten infinity norm (of 
positive semi-definitive operators) as an SDP 



max tr (AO) 

tr(Q)<l 



mm 7 . 

A<jl 



Thus, if 7* is minimal for ||A||, we have £(A) < J*£(l) < 7*. 
Hence, 7* is feasible for ||£(A)||, concluding the proof. ■ 

B. The Smooth Entropy Framework 

We use the purified distance lfl8l on sub-normalized quan- 
tum states and write p sa £ a if and only if P(p, a) < e, where 
P(p,a) := y/l-F*(p,a) and 



F(p, a) :=tr| y/py/j\ + y/(l - tr p)(l - tr a) . 

generalizes the fidelity to sub-normalized states. The purified 
distance has the following properties |18|. 

• Uhlmann's theorem: Let pab G S<(^Kab) be a bipartite 
state and ta € S<(3~Ca) with ta ~ £ Pa- Then, there 
exists an extension tab of ta such that tab ~ £ Pab- 

• Monotonicity: Let £ be a trace non-increasing CPM. 
Then, p « e r £{p) w e £(r). 

. Fuchs-van de Graaf: D(p,r) < P(p,r) < ^2D{p,r), 
where D(p,r) — § tr \p— r| + || tr p — tr r| < tr |p— r|. 

For any sub-normalized state p G 6><(!H), we use B £ (p) := 
{p G 5<(5t) | p ~ £ p} to denote the set of states close to p. 

We use the following relative and conditional entropies B4l . 
[18 1, which are based on Renner's initial definition of the 
smooth min-entropy [10|. 

Definition 1: Let p G 5(J£), a G "P(Jt), and < e < 1. 
Then, the relative max-entropy is defined as 

D e max (p\W) := min inf{A G R | p < 2 A a} . 



where we optimize over all p G S< (Jf ) with p « £ p. 

Definition 2: Let p^s G and < e < 1. Then, 

the smooth min-entropy is defined as 

where the optimization is over all as G S(3~Cb)- 

We employ the following property of this entropy lfT8l . 

Lemma 2: Let pab G 5<(5f^s) and let U : A — > A' be 
an isometry. Then, i^ in (A[.B) p = H^ m {A'\B) UpU i . 

Moreover, the smooth min-entropy is monotonous under the 
application of classical functionsPl 

Proposition 3: Let pxab = 2~2 X P X \ X )( X \ ® 'Pab be a state 
in 5<(IKxas) and let / : X — > Z be a function. Then, 



H^- m (XA\B) p > H^ in (f(X)A\B) p , where 

E P*<t>AB 
xdf-Hz) 



PZAB 



Proof: We first show the statement for the trivial function 
f(x) = 1, i.e. we show that H^ in (XA\B) p > H^ in (A\B) p . 

For a state p^s ~ e Pab that maximizes _ff I '5 lin ( J 4|_B)p, we 
define an extension pxab G S<("Kxab) with px^is ~ e 
Pxab using Uhlmann's theorem for the purified distance. Fur- 
thermore, using the pinching Mx ■ L i-> ^ x |a;)(x|L|x)(a;|, 
we define 

Pxab ■= Mx(pxab) ~ € Mx(pxab) = Pxab- 

Now, note that pxab has the form pxab — 2~2xPx\ x )( x \ ® 
^ B and pas = Y.xP'^ab < 2" A U (g) cr s for A = 
i?^j n (A|B) p and some ctb G S(^Kb) by definition. Moreover, 



Vx:p x <l> x AB <2~ A 1 



XA 



0~B 



and we thus have H^ in (XA\B) p > A, concluding the proof 
for this special case. 

For general functions /, we first apply the isometry Ut : 
\x) 1 — ^ |a;) ® \f(x)) from X to ATZ. We then define the state 



txzab = UjpxabUJ- = y^ j p x S z j( x )\i 



,j ab > 



which is an extension of both pxab and pzab- The statement 
now follows from the relation 

Hf nin (XA\B) p = H^ n {XZA\B) T > H^- m (ZA\B) T , 

where the equality is due to Lemma [2] and the inequality is 
covered by the special case already shown. ■ 

C. The Hypothesis Testing Entropy 

Another one-shot entropy has been used in hypothesis 
testing and channel coding ll30l . 

Definition 3: Let p G 5(JC), a G V(3C) and < e < 1. 
Then, the e-hypothesis testing relative entropy is defined as 

2 -dUp\W) : =mi{(Q,a)\0< Q < 1 A (Q,p) > 1 - e}. 

It is easy to verify that the infimum is always attained and 
D e h (p\\a) takes values in [0, 00] when we supplement the real 
axis by 00 and define — log = 00 for the binary entropy log. 

3 See also [19 |. Renner 1 1 1 showed this property for e = 0. 
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We will use conditional versions of this entropy, defined as 
follows. 

Definition 4: Let p A s G S{^K ab ), ®b € S^Kb) and 
< e < 1. The conditional e-hypothesis testing entropies 
are defined as H^(A\B) p \ a . := -D e h (p AB \\ 1a ® ctb) and 
Hl(A\B) p :=m3x a H* h (A\B) pW . 

The following two expressions for 2 H ^ A \ B }f> are equivalent 
due to the strong duality of semi-definite programs [35|. 



Qa B and dual optimal operators of the form Nxayb = 



primal problem 
min: ||Q B || 
s.t.: < Qab < 1 



dual problem 

max: rj{\ — e — tr(iV^ B )) 

s.t: 7] > 0,N AB > 0,cr B > 
1. 



{Qab, Pab) > 1 - e pab < -^a® (?b + N A b 

-q 

tx(a B ) < 1 

Accordingly, we call an operator Qab that satisfies < 
Qab < 1 an d {Qab, Pab) > 1 — £ primal feasible. If it 
also attains the minimum, we call it primal optimal. Similarly, 
a triple {Nab,°~b^t} of positive semi-definite operators is 
called dual feasible if it satisfies pab < ^A <8 cb + Nab 
and tr(as) < 1. It is called dual optimal if it also attains the 
maximum. 

We now explore some properties of the hypothesis testing 
entropy. It satisfies a data-processing inequality. 

Proposition 4: Let pab G S(^Kab), let £ : A —> A' be a 
sub-unital TP-CPM, and let T : B ->• B' be a TP-CPM. Then, 
ta'b> = £ ° ^(pab) satisfies Hg(A|S) p < ^(A'|B') T . 

Proof: Let {A^s, ctb,v} be dual optimal for i3^(j4|.B)p. 
Then, applying £ o T on both sides of the inequality p^s < 
^1,4 (g) ctb + N AB leads to 

tab < -£(1a)®F{<tb)+£°F{N A b) < -1a<®&b>+Na'b>, 

where Na'B' — £ ° J~(Nab) and erg' = J^ctb)- Hence, 
the triple { N A > B > , a B , , r/} is dual feasible and H^(A'\B') T > 

7j(l-e-tr(iW)) -^(^I b )p- ■ 
The following result implies that structure of the state can 
be translated into structure of the optimal primal and dual. 

Lemma 5: Let pab € S(3{ AB ) and let £ : A — > A be 
a unital TP-CPM and J" : B ->■ B be a TP-CPM such that 
£ o J- (pab) — Pab- Then, the following holds for the SDP 
for Hf l (A\B) p : (1) If Q'ab is primal optimal then Qab = 
£^ ° J 7 ' (Q'ab) * s Primal optimal too. (2) If {N' AB , a' B , rf\ is 
dual optimal, then {Na B , o\b, t/} with N AB = £ ° F{N'ab) 
and as = F{v' B ) is dual optimal too. 

Proof: Let Q'a B be primal optimal. The operator Qab 
as defined above is feasible since the inner product satisfies 
(&°J^{Q' ab ),Pab) = (Qab>£ o:f (pab)) and Q AB < 1 AB 
since the maps £t and are unital. Moreover, ||^(Qs)|| < 
\\Q B \\ due to Lemma [T[ which implies that Qab must be 
optimal. Similarly, let {N' AB , a' B , ij} be dual optimal, then 
{N AB , a B , r]} as defined above is clearly dual feasible and 
optimal. ■ 
Corollary 6: Let Pxayb & S(3-Cxayb) be classical on 
X and Y. Then, the SDP for H e h {AX\BY) p has a primal 
optimal operator of the form Qxayb — J2 X y \ x )( x \ ' 



E x , y \ x )( x \ <8> \y)(y\ ® and ay S = £ y 

The following proposition gives bounds on the entropy of 
classical information. 

Proposition 7: Let pxab = ^2 x Px \ x )( x \ ®4> AB be a state 
in S(3{xab)- Then, using |X| = |supp{p x }|, 

H s h (A\B) p > Ht(A\XB) p > H s h (XA\B) p - log \X\, (4) 
H%{A\XB) p < Hi{XA\B) p < H e h (A\B) p + log \X\ . (5) 

Proof: We prove the four inequalities separately and apply 
Corollary [6] which ensures that the primal optimal operators 
are classical on X. 

The first inequality in Q follows from Proposition |4] using 
the partial trace over A as a TP-CPM. 

To prove the second inequality in Q, let Qxab = 
J2x- P >o \ x )( x \®Qab be primal optimal for H^(A\XB) p = 
log maxj ||Qs ||' Thus, Qxab is primal feasible for 
Hl(XA\B) p and we find 



H e h {XA\B) p < log \\ti xa{Qxab)\\ = log 



< 



log ( I suppfe } I max 1 1 Q X B 

V ' x 



x: p x >0 



which implies the result. 

The first inequality of |5]l follows in a similar fashion, 
only this time we choose Qxab to be primal optimal for 
H e h (XA\B) p . Clearly, 



H e h (A\XB) p < logmax||g B || < log 

X 

= Hf l (XA\B) p . 



x: p x >0 



Finally, to prove the second inequality in Q, we consider 
the SDP for H e h (A\B) p = log||Q B || with Qab primal 



Q 



AB 



optimal. Then, the operator Qxab = J2x-. Pn: >o 
is primal feasible for Hf l (XA\B) p . In particular, it satisfies 

^(QxabPxab) = E Px tT (QAB <j> AB ) 

x:px>0 

= ^{QabPab) > 1 - £, 

where the last inequality follows from the primal feasibility of 
Qab- Hence, we have H e h (XA\B) p <\og\\tv X a{Qxab)\\ = 
log | supp{p K }| + log \\Qb II , which concludes the proof. ■ 

III. One-Shot Characterization of Randomness 
Extraction 

Given a state pxb = Tl x Px\x)(x\ <g> cf> B that is classical on 
X, we want to extract a random string, Z, that is independent 
of the quantum side information B. We consider randomized 
protocols V = {S, 2,p s , h s } that consists of a seed, S, an 
output set Z, and for all s s S, a probability p s and a hash 
function h s : X —> Z. 

The protocol now acts on the initial state pxbs = Pxb <8> 
J2 s Ps\ s )( s \ by applying the function h s depending on the 
value s in the register S. This results in tzbs = J2 s Ps\ s )( s \® 
t s zb , where r| B = J2 Z \ Z )( Z \ ® Hx&hj'^P^B- Alterna- 
tively, this evaluation can be modeled using a TP-CPM JF 
from XS to ZS, such that r^ss = J-(pxbs)- 
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We characterize the extractable randomness of a state pxs 
by the amount of randomness (in bits) that can be extracted 
such that Z is close to uniform and independent of E and S. 

Definition 5: Let pxs G S(!Kxb) be a CQ state. Then, we 
define the maximal extractable randomness as 

£ e (X\B) p := max{£e N\3V: \og\Z\ > £ A d scc < s}, 

where V is any protocol defined above and d sec (V, Pxb) '■= 
m\n aB P(tzbs> ^z ® cr B ® T s) evaluated for the final state 
t~zbs obtained using V and ttz = lz/\Z\ fully mixed. 

Note that the independence of the resulting randomness is 
often quantified using the trace distance instead of the purified 
distance. In particular, universal composability requires that 

dcompiV, pxb) ■= ^tr \t X bs - ® t B s\ < 2d scc (V, pxs) 

is small. However, we are only able to derive tight second or- 
der asymptotic s for the relaxed requirement used to define £ e . 

We show that the operational quantity £ £ can be character- 
ized by the smooth min-entropy in the following sense. 

Theorem 8: Let pxb £ S(!Kxb) be a CQ state and < 
r\ < e < 1, Then, 

H e ^(X\B) p - log i - 3 < t e {X\B) p < H^ in (X\B) p . 

We prove the direct part (left-hand inequality) and converse 
part (right-hand inequality) separately. 

A. Proof of Converse 

The converse employes the monotonicity of the smooth min- 
entropy under classical functions and is adapted from fl9l . 

Proof of Converse of Theorem^ We prove the statement 
by contradiction, i.e. we show that for all protocols V, 

log \Z\ > H^ in (X\B) p d scc (V,p XB ) >e, (6) 

which implies the converse. 

For this purpose, let V be any such protocol with log \ Z\> 
H^ in {X\B) p . Hence, for any s G S, we have H^ in (Z\B) T , < 
log \Z\ due to Proposition [3] By the definition of the smooth 
min-entropy, this implies that irz O <jb 4- ^ 6 { t zb) anc ^ thus, 
F(t zb ,tt z <8 (Jb) < vi — £ 2 for an Y gb- Hence, 

maxF(T Z BS,irz <S> &b <E> ts) 

a 

= maxY^ p s F(t zb ,7t z ® cr B ) < \J I - e 2 , 

a * — » 

which shows the implication |6) and concludes the proof. ■ 

B. Proof of Achievability 

We employ a result for two-universal hashing in [10]. 
Proof of Direct Part of Theorem [Sj- We employ Corol- 
lary 5.5.2 in ifTOl . which states that, for any protocol V using 
two-universal hashing, and any CQ state pxs G <5<(?Cxb)> 
the state tzbs = J~(pxb) satisfies 

tr \f ZBS -TT Z (g>r BS \< ^\Z\2- H ^( x \ B 'h . (7) 



Now, let pxs G B £ -i(p XB ) be such that H^(X\B) p = 
H min (X\B) p . Then, 

dseciV, pxb) < P{t~zbs, ^z ® t e <E> t s ) 

< P(tzbs, nz®T E ® t s ) + P{f Z BS, tzbs) 

< v^tr \ f Z BS - k z ® te ® t s \ + e - rj 

<y/2(\Z\2- H -**( x W>)*+e-r). 

Here, we used the Fuchs-van de Graaf inequality, Eq. and 
the fact that P{t Z bs,tzbs) < P{pxb,Pxb) < £ - r?. Thus > 
the condition d scc (V , px b ) < £ is satisfied when 

y/2(\Z\2- H '^ x W>)* < V 
and we find that the choice 

£ = log \Z\ = [H^(X\B) p + logr? 4 - 2j 
achieves the required bound. ■ 

IV. One- Shot Characterization of Source 
Compression 

Given a state pxb = Tl x Px\x){x\ (g> (f> B that is classical on 
X, we are interested in compressing X into a register M such 
that M and the quantum system B allow to recover X with 
an error at most e, where < e < 1 is a fixed constant. We 
consider a general class of randomized compression protocols. 

A protocol V = |<S, M, {p s }, {e s }, {D s m }} consists of a 
seed, S, a code book, M., and, for all s G S, a probability 
p s and an encoder function e s : X — > M. Furthermore, for 
every s E S and every m G A4, we have a decoder POVM 
^s.m = {^m ,m }a; that measures X' on B. The protocol can 
be split into an encoding and decoding part: 

• The encoding protocol acts on the initial state pxsss' = 
Pxb <8> J2 s Ps\ ss )( ss \ss> with the isometry U e = 
J2 x ,s I^X^U ®|e a (a;)) m®I s X s Is- Informally, this means 
that, depending on the value s G S in the register 5, the 
function e s is applied on X and the result is stored in 
M, while a copy of X is kept. This operation results in 

the state t X mbss> ■= U e p X BSS'Ul- 

• The decoding protocol acts on the systems S', M and B 
to extract X' . We describe this operation using a TPCPM 
T>, which reads out s G S from S' and m £ M from 
M and then applies the measurement T> SMl on _B. This 
is given by the TP-CPM V from MBS' to AT'S', 

: txmbss' !->• X! NX^U' ® |sX s k ® 

tr M BS'((|s><s|s" ® |m)(m| M ® M|' m )r X MBSS')- 

We denote the final state by uxx'ss' : = T)(txmbsS')- 
Using this class of protocols, we characterize the compress- 
ibility of a state pxb by the minimal compression length (in 
bits), m £ , required to achieve an error probability of at most e. 

Definition 6: Let pxb G S(3~Cxb) be a CQ state. Then, we 
define the minimal compression length as 

m £ {X\B) p := mm{m G N | 3V: log \M\ < m A p GIT < e}, 



JOURNAL OF BTpX CLASS FILES, VOL. 6, NO. 1, JANUARY 2007 



8 



where V is any protocol defined above and p crr ('P, Pxb) 
E s <-p„(Pr[X ^ X'\S = s]) evaluated for the state ljxx'SS' 
as defined above0 

We show that the operational quantity m e can be character- 
ized by the hypothesis testing entropy in the following sense. 

Theorem 9: Let pxb € S{"Kxb) be a CQ state and < 
rj < e < 1. Then, 



= V s , m = {M*> m } x , where 



log ^ 



We prove the direct part (right-hand inequality) and con- 
verse part (left-hand inequality) separately. 

A. Proof of Converse 

The proof of the converse utilizes various monotonicity 
properties of the hypothesis testing entropy. 

Proof of Converse of Theorem [9} It is sufficient to show 
that, for every protocol V as defined above, the requirement 

p C rr{V,p XB ) < e implies H%(X\B) P < \og\M\. 

Let V be any fixed protocol with p exI (V, Pxb) < £ and 
let pxbss', txmbss' and ujxx'SS' be defined as above. We 
employ the projector Qxx'SS' = Ei. s \xx)(xx\®\ss)(ss\ and 
note that tr(Q X x'SS>uxx>ss>) = 22 s P s Pr[X = X'\S = 
s] > 1 — £ due to the requirement on p c „(P, Pxb)- Thus, 
Qxx'SS' is primal feasible for H^(XS\X'S') U and we have 

= log HQx'S'II 

> H E h (XS\X'S% 

> H e h {XS\MBS') T (cf. Prop. |) 

> H s h (XMS\BS') T - log \M\ (cf. Prop. ^ 
= fr^(X5|S5')p - log |A*| (cf. Prop. § 
= ^(X|fl) p -log|M|. 

The final equality follows from the following observation. 
Let Qab and {Nab, o~b, v} be primal and dual optimal for 
Hf l (X\B) p , respectively. Then, it is easy to verify that Qab® 
Yj S \ss)(ss\ is primal feasible and {Nab ® Pss' ,<J B ® ps,v} 
is dual feasible for Hf i (XS\BS') p , which implies equality. ■ 

B. Proof of Achievability 

The proof is heavily based on [11 1 (see also QUI . GUI ); in 
particular, we need the following result IfTTI : 

Lemma 10: For any c>0, 0<S<1 and T > 0, we have 
l-(S + T)-iS(S + T)-i < (l + c)(l-S) + (2 + c+A)T. 

The proof now employs two-universal hashing in the en- 
coder as well as pretty good measurements in the decoder. 

Proof of Direct Part of Theorem |9f We propose the 
following protocol. The encoder creates M from X through 
two-universal hashing [36 1, i.e. we consider a family of 
encoders and a seed satisfying Pr s <_ P8 [e s (x) — e s (z)] < 
if x 7^ z. Let Qxb = J2x \ x )( x \ ® Qb be primal opti- 
mal for H e h ~ r ' \X\B) p \ p . Then, we use the decoding POVMs 

4 Note that this definition implies that the optimal protocol for a fixed state 
Pxb is deterministic since we can always fix the seed to the value that 
achieves the lowest error probability in the average. 




^ 2ge s 1 (m) 

It remains to bound the average error of this protocol on 
the state pxb- To do this, first note that p crr (P, Pxb) 
E.<- P .,»<-p. [tr(p|(lfl - M s x ' eAx) ))}, where 1 B - M s x ' e < 



(x) 



as 





can be upper-bounded using Lemma [TO 

_i 

Qb i 

< (l+c)(l B - Q%) + (2 + c+-)Y / S eAz)=es(x) Q B 

Z^X 

for any c > (to be optimized over below). We now substitute 
this to bound Pe T r(P, Pxb), i-e. 

Perr(P,PXB)< (1+c) E [tr(^(l B - Q B ))] 
x-h-p x 

+ (2+c+-) E \y2S es{z)=esix) tr{p B Q B ) 

Z^X 

The second expectation value can be simplified using the two- 



universal property: E s <-p„ [S, 



e s (z)—e s (x)_ 



< 



\M\ 



if x ^ z. We 



can thus upper bound the whole expression by 

Peix('P,PXB) < (1 + c)tr(psx(lxB ~ Qxb)) 

3 i ]•(//;. > ]q%)] 



z^x 



< (1 + c)(e - i]) + 



c\M\ 



2 H' h -^(x\B) p _ 



Here, we used that tr(p B xQxB)) > 1 — (s — v) and 
tftrW"**' = tv(p B Q B ) > Efxtr(p|^Ql 



Z^X 



Hence, this protocol will lead to an error of at most e if 

" £ ^(X\B) plp -' ' 2 " 



we choose m = log \A4\ 
The choice c — — ^ 



then leads to the desired bound. 



V. Hierarchy of Relative Entropies 

We will use the following properties of the relative en- 
tropies, which can be verified by a close inspection of their 
respective definitions. 

. Monotonicity: For any TP-CPM £ , 

x = h,max: Dl(p\\a) > D%(£ (p)\\£(a)). (8) 

• When a < a', we find 

x = h,m&x: Dl{p\\a)>D e x {p\\a'). (9) 

Furthermore, if a and a' commute, this extends to x = s. 

• When a' = 2~ A er, we further have 

x = h, max, s : D x (p\\a) = D x (p\\a) + A. (10) 

Furthermore, Lemma 9 of [28 1 is of pivotal for our analysis. 

Lemma 11: For any p,a G V{'K), we have p < v(a)£ a (p), 
where v(a) denotes the number of different eigenvalues of 
a and £ a is a pinching that projects on the eigenspaces 
corresponding to the different eigenvalues of a. 
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A. The Information Spectrum 

We introduce the following quantity, which is as an entropic 
version of the quantum information spectrum |12|. (The rela- 
tion of this quantity to the more traditional formulation of the 



information spectrum is explained in Section |VIII ) 

Definition 7: Let p G 5<(JC), a G V(fK), and < e < 1. 
Then, the information spectrum relative entropy is defined as 

D £ 3 (p\\a) := sup{R G K | tr p{p < 2 R a} < e} . 



If p and a commute, we may expand them in a common 
orthonormal eigenbasis, e.g. p = 2~2 y r y\ u y)( u y\ an d a = 
2~2 y s v\ u v)( u y\- Consider now the distributions P(y) = r y and 
Q(y) = Sy , we find that tr p{p < e R a} = P{logP-logQ < 
R}, and recover the classical information spectrum D^(P\\Q) 
as defined in Q. 

The information spectrum is intimately related to hypothesis 
testing, as has been pointed out in B12L Here, we present a 
proof in the one-shot setting for the convenience of the reader. 

Lemma 12: Let p G S(3i), a G V(0i), and 8 > 0. Then, 

Dl(p\\a) < Dl(p\\a) < D s + S (p\\a) - log 5. (11) 

Proof: The first inequality follows by considering the 
projector Q = {p > 2 R a} that is primal feasible for D^{p\\a) 
when R = D e s (p\\a) — £ for an arbitrary £ > 0. Furthermore, 

(a, Q) = tr a{p > 2 R a} < 2~ R tr p{p > 2 R a} < 2- R . 

Hence, D £ h (p\\cr) > R, which implies the result when £—5-0. 

To get the second inequality, consider first the case where 
Z)|(p||cr) is finite, and an operator < Q < 1 that is primal 
optimal for D e h {p\\a). Using fi = log 8 + D e h {p\\a), we find 

tr p{p > 2»a} > tr((p - 2^a){p > 2^a}) 

>to{b-W*)Q)={p,Q)-2f>{<r,Q) 
>l-e-S, (12) 

where the last inequality follows from the fact that Q is primal 
optimal. Thus, D e s +S {p\\a) > p, concluding the proof. Finally, 
in the case where (tr, Q) = 0, Eq. ( p"2| ) holds for any p and, 
thus, both sides of the inequality diverge. ■ 
Furthermore, we consider the information spectrum for the 
state £cr(p), where £ a is a pinching of p in the eigenbasis of 
a, i.e. 



£„(p)=J2 p °P p °> where p ° 



E 



\Uy)(Uy\ 



is the projector onto the eigenspace with eigenvalue s. We 
will see in the following that the entropies D e s (£ a (p)\\a) and 
Dl(p\\a) are related. Furthermore, £ a (p) and a commute. 

In order to refine this analysis and make it applicable for 
the second order expansion, we employ the probability intro- 
duced by Nussbaum and Szkola E9l . Using the eigenvalue 



decompositions p = 2~2 x r x \v x }(v x 
they defined two distributions: 



and <J = J2 V S 



P Pi „{x,y) := r x \(v x \u y )\ 2 and Q Pt<7 (x,y) := s y \(v x \u y )\ 2 . 

These distributions have the very convenient property that 
the first two moments of log P P:(T — log Q PiCr under P p a agree 



with the respective moments of log p — log a under p. Namely, 
it is easy to verify that 

D{P p ,a\\Q P: a) =D(p\\a) and s(P Pta \\Q p , a ) = s(p\\a). (13) 

Moreover, in the i.i.d. scenario, we have P p ^^ — P™ CT and 
Qp",cr" = Q n p a usm g the notation introduced previously. 

The asymptotic analysis can thus be reduced to the problem 
of finding suitable relations between the one-shot entropies and 
the quantity D e s (P p ^\\Q p ^) for general p and a. 

B. Useful Inequalities for Relative Entropies 

We obtain the following inequalities for relative entropies. 
Proposition 13: Let p G S(^K), a G V(JK), < e < 1, and 
< S < e. Then, using v — v(cr), we obtain 



D^(p\\a)>D s -%p\W)+2logS-2-\oge, (14) 

l-e 



D 



(Hk)<^(^(p)lk)+log^-log(l-e), (15) 



D s - d (£,(p)h) < DXPpAQp,*) - lo s^ (16) 
Dl{£ a {p)\\o) > Dr 5 (P P ,a\\Q P ^)+^gS-logiy, (17) 

D^(p\\a)<D%(p\\a)+logv-log(l-e), (18) 

Df^{p\W) > Dl- S (p\\a) +31o g( 5-31og3-log e . (19) 

Proof of ( |14) : Assume e (p\\a) = R and choose p 

such that p < 2 R a and F(p, p) 2 > e. Now, we consider the bi- 
nary projective measurement {{p < 2 R+S a}, {p > 2 R+S a}} 
for some 5' > 0. The monotonicity of F yields 



< \ tip{p < 2 R + S 'a}\ /tr p{p < 2 r + s '<j} 



trp{p > 2 R + s 'a}Jti p{p > 2 R+5 'a} 



< \Jtrp{p < 2 R + S 'cr} + ^tip{p > 2 r + s '<t}. (20) 
Moreover, the condition p < 2 R a implies that 

trp{p > 2 R+S 'a} < 2 R tia{ P > 2 R+S ' a} 

< 2- s ' trp{p > 2 R+S 'a} < 2~ s ' . 

Combining this with ( |20| > and choosing V2~ s ' = *Je—\[e — 5, 
we find tr p{p < 2^+^} > e - 8. Hence, 

4e 



D e ~ b {p\\a) < R + 8' < R + log 



S 2' 



where we used s/e — \/e — 8 > in the last step. ■ 
Proof of §T5§: We set R = D £ s (£ a (p)\\cr) and define Q = 
{£*(p) < 2 h cr}- Thus, Q satisfies tr(pQ) = tv(£ a (p)Q) = e, 
where we used that £ a leaves Q invariant. Now, we choose 



s.t. F(p,p) 2 > tr(pQ) =e. 



tr(pQ) 

Moreover, Lemma [TT| shows that p < is£ cr (p) and, thus, 

QpQ < v Q£ a {p)Q < v 2 R QaQ <v2 R a. 

where we used the definition of Q and that it commutes with 
a in the final two inequalites. Thus, p < a and 

D^(ph) < inf{A | p < 2 x a} < log v + R - log(l - e), 
completing the proof. ■ 



JOURNAL OF BTpX CLASS FILES, VOL. 6, NO. 1, JANUARY 2007 



10 



Proof of ( p~6] > and ( fTTj i: Since £ a (p) commutes with a, 
there exists a common eigenvector system {u y }, i.e. 

£ °(p) =^2 r 'y\ u y)i u y\ and CT = E s fKX%l- 
y v 

Using the representation p = J2 X r x\v x )(v x \, we can describe 
distributions P p a and Q p , a as follows 

P Pi(7 {x,y) = r x \(v x \u y )\ 2 and Q p , a (x, y) = s y \(v x \u y )\ 2 . 

Furthermore, we define the distribution 

Q' P Jx,y) := r' y \(v x \u y )\ 2 

and note that D £ s (£ a {p)\\a) = D e s {Q' pa \\Q p ^). We drop the 
subscripts p and a in the following. For real R and 5', we find 



?{l0E~<R} 



Q 



=^{i°4 +log ^-*} 



Q' 



> 



> P {la g ^<R-5' Alog-<^} 



= l-P{log|>P-y Vl g|>^} 

>l-P{log|>P-*'}-P{log|>/f 

= p{log|<P-4-P{log^>4 
Similarly, we bound 



(21) 



p{log|<P + /f> 



P{ log £ < R}-P{ log ^<-^} 



(22) 



Moreover, we have P{log ^ < -5'} < P{P < 5} < S, 
where we chose 5' = — log S. Hence, if we further choose 
R = D £ S {P\\Q), we have P{log g < i?} < e by definition. 
Together with d22|, this yields 



p{log|<R 



> e 



which directly implies ( |T6] >. 

Applying the Markov inequality, we obtain 

P{]og£>*}=p{£> 2 -'} 

<E^)5H 2 ^' 

= 2- 5 'tr(p%(p)- 1 ). 

Since the quantity tr(/9 2 <r _1 ) decreases under the operation of 
TP-CPMs 1 37 1, it also satisfies joint convexity. Hence, using 
the eigenvalue decomposition of p, we have 

tr{p 2 £ a {p)- 1 ) < ^^trd^X^l^d^X^I) -1 ) 

X 

<max<0|f CT (|0)(0|) _1 |0). (23) 



Furthermore, since £„ is a projective measurement of the form 
{Mi}i =1 , we may write £ a = J2i=i oti\ipi)(ipi\ for 

coefficients ctj = ((p\Mi\4>) and orthonormal vectors \ipj) — 
-7=Mi\<t>). The expression (|23|) now yields 



a, > o 



i=l 
a, > 



Finally, we thus find P{log ^ > 5'} < 2 5 ia The choices 
5 = 2~ 5 V and i? = D e - S {P\\Q) together with ([21} yield 

p{log|<P-log^}<p{log|<p} +5 < £ , 



which concludes the proof. ■ 
Proof of ( fT8j ) ant/ ( |19) : The last two inequalities follow 
from the previous relations. We have 



£>2£T(pH < £>I(M/>)IH + iog^ - io g (i - 

^^(^(pJH+loglz-logCl 

^D^plH+togiz-bgCl-e) 
Furthermore, using Eqs. ( fl4| > and (jTTJ 

^J^Cplk) > ^ (Plk) + 2 log 5 a - 2 - 
0||cr) + log£?<5 2 - 

5 



> D 



and the choice S\ = 



e—5i —<52 

f , <5 2 = | yields (19) 



e) (cf. Q5)) 
■e) (cf. (fTT)) 
(cf. <§) 



loge 

2 - loge, 



C. One-Shot Entropies and the Information Spectrum 

The above relations allow us to bound the hypothesis testing 
and smooth entropies in terms of the classical information 
spectrum of P p a and Q p . a . 

In order to refine these statements, we need the following 
notation. For a given positive semi-definite matrix a, we de- 
note the number of distinct eigenvalues of a by v(p). We also 
define the number A(cr) := logA max (cr) — log A m i n (cr), where 
Amax is the maximum and A m i n the minimum eigenvalue of 
a. Finally, we employ 

9(a) := min{2[A(cr)l, v(a)}. 

The bounds can now be stated as follows. 

Theorem 14: Let p G S(0i), a G P(JC) and < e < 1 and 
< 5 < min{e, 1 - e}. Then, using 6> = 9(a), P 
Q = Q P ,a, we have 



P p ,a and 



£>|(p||<7)<Z> s £+5 (P||Q)+log 



2 8 e<9 
<J 4 (l-e-5) : 



^(Plk) > ^r 5 (^IIQ) - log^ + log 8, 



(24) 
(25) 



D^ s (p\W) <DI+ 5 (P\\Q)+ loge -log (*(l-e)), (26) 



(27) 



Proof: We first show weaker inequalities for i/(<r) in place 
of and then argue that the inequalities still hold if we replace 
v by 2[A(cr)]. In particular, this implies that they also holds 
for the minimum of these two expression, i.e. for 9. 
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Let v = v(a), and Si > 0, i = 1 . . . 3, such that J^i $i — 
(We will optimize over these partitions later.) We find 



<D^(p\\a)-\og6 1 

< Dil^- s ^{p\W) - log Ml + log(4e) 

< D s + 5 ^(S a (p)\\cr) -\ogS.6l +log(4e) 

- log(l - e - Si - S 2 ) + log v 

< D e s +S ^ +S3 (P\\Q) -lo g 5 1( 5|(53 + log(4e) 

- log(l - e - Si - S 2 ) + log v. 



(cf. {TT}) 
(cf. gg) 

(cf. {B)) 

(cf. Q5|) 

Choosing <5i = ^ = |, = § yields ( |24"| >. Next, we have 



DUfh) > Dl(£ a {p)\\a) 

> Dl(S a (p)\\a) 

> D £ - s (P\\Q)+\og5-\ogv, 



(cf. @) 

(cf. <[TT}) 

(cf. C3> 

which constitutes Q25| l, Then, Eq. |26) follows from 

< D e s (E a (p)\\<r)+ logv-log(l-e) (cf. (fB]l) 

<£>| +5 (F||Q)-log(5 + log^-log(l-e). (cf. 

Finally we show p7) . For any <5i, ^ > such that 61+62 = 6, 

D 2^(p\\<r) 

>D^(£ a (p)\W) 

> Dl-^(£ a {p)\\<j) + 2\og5 1 - log(4e) 

> D s -^- 5 *(P\\Q)+\ogS{5 2 -\og{Asv). 



(cf. (§) 
(cf. H§) 
(cf. 



Choosing 



2,) 



j, we obtain ( |27j ). 



The above inequalities can now be adapted such that v 
is replaced by 2[~A(cr)"|. We exemplify this by proving the 
inequality corresponding to ( p4| i. However, the argument is 
analogous for all inequalities in the theorem. 

For a positive integer I to be determined later, we de- 
fine a' by the following procedure. First, we diagonalize 
a = J2 V s y\ u y)( u y\ w i tn s i > s 2 > ■ ■ • > s d an d define 
A = A(cr) = log si — log Sd- Moreover, we define = s^2t 
when logs,, G (logs d +-i, log s d +y (i+1)] fori = 0. . .1— 1 
and s' — Sd if s y — Sd- Hence, a' < a and a" 2~~ a < a'. 
Since the number of eigenvectors of a' is at most I, ([24b yields 



D%{p\\a) < Dt(p\\a') 

<Dl +s (P p ^\\Q Pia ,)+log 



2 s el 



<Dl + \P p ^\\Q p , a „)+\og 



61(1-6-5) 
2 s el 



= Dl+ s (P p , a \ 



log 



6 4 (l-e-6) 
2 s el A 

51(1-6-6) + r 



Finally, substituting |~A(er)] into I, we find j 
log2[A(cr)] and, thus, 



(cf. <(9}) 
(cf. (f2§) 

(cf. <|9)) 

(cf. <[T0f) 

+ log I < 



D%(p\\a) <Dl+ s (P P AQ P ,a) +\og 



2 8 e ■ 2fAl 

5\i-E-sy 



VI. Asymptotic Analysis 

We first investigate the behavior of the classical information 
spectrum of the Nussbaum-Szkola distributions in the asymp- 
totic limit. For this purpose, we consider the quantity 

D e s (P P AQp^) = sup{i? e K I P{Z <R}< e}, 

which is equivalent to F^ 1 (e), the inverse of the cumulative 
distribution function of Z = \ogP p (J (X) — log Q p . a (X ). 

In particular, we are interested in i.i.d. states p n — p® n and 
a' 1 — a® n . Then, the respective Nussbaum-Szkola distribu- 
tions are also of the i.i.d. form P pa .(x) = Yii Pp-<?( x i) an d, 
similarly, Q™ a (x) = Yli Qp,a(xi). It is easy to verify that the 
information spectrum evaluates to 

D s s (PpJQp,a) =nsup{R\P n {Z <R}< e}, (28) 

where Z = — %i is averaged over n i.i.d. random variables 
Zi = logP p , a — logQp !<T . Now, due to the central limit 
theorem, the distribution of 



Z — p 



y/n~ ^ , where p = E[Z] and s = ^E[(Z - p) 2 ] 

converges to the normal distribution. More precisely, the 
Berry-Esseen theorem states that 



P" 



r Z - p 

in < z 

s 



Hz) 



< 



Cr 3 



where r = {/E[\Z - p\ 3 }, C < | (381, and the cumulative 
standard Gaussian distribution is given by 



1 



/2vr 



We now evaluate the terms, using ( fT3j ). We get 

p = D(P p , a \\Q p ^) = D(p\\a), 
s = s(Pp >a \\Q p>a ) = s(p\\a) 
r = r(Pp. r7 \\Q p . a ), 



and 
where 



r(PpA\Qp,*) 



Combining this with ( [28) , we can write 

D e s (PpJQ P \) = nD(p\\a) + sfrs{p\\o) 



■ sup < X 



pl< 



fn- — - < x ) : 



and further use the Berry-Esseen theorem to bound 



D S s(Pp,JQ n p ,a) <nD(p\\a) + VTis(p\\a)^ 1 [e , ; 
D s s {Pp,JQ n P ,,) > nD(p\\a) + ^is(p\\a) ^ (e . 



Cr 3 



Cr 3 



in, 
(29) 



Finally, assume that r is finite. Then, since $ 1 is continu- 
ously differentiable, for any fixed e £ (0, 1) and 6 proportional 



JOURNAL OF BTpX CLASS FILES, VOL. 6, NO. 1, JANUARY 2007 



12 



to 1/y/n, we have the following asymptotic expansion for 
large 

Dl ±s {P;\ a \\Ql a ) = nD{p\\a) + vMHH^) + 0(1). 

(30) 

A. Asymptotic Behavior of Relative Entropies 

We first investigate the asymptotic behavior of Df l (p n \\a n ) 
and D^ lax (p n \\a n ) for large n. A straight-forward application 



of Theorem 14 yields, for < 5 < min{£, 1 — e}, 



D e - 5 (PZ*\\Q n P ,<,) ~ log 



6(a n ) 



<Dr\p;jQi a ) + \o g 



<mp n h n ) 

2 s e6(cr n ) 



<5 4 (l-£-5)' 

Now, we observe that 9(<j n ) < 2\X(a n )] = 2\n\(a)~\ scales 
at most linearly in n if X(a) is finite. Furthermore, choosing 
(5=1/ y/n, we can apply (30) to get 



Di(p n \\a n ) = nD(p\\a) + ^(pH^e) + O(logn). 

(31) 

An analogous relation is derived for _Df nax (p"||er"), where 
and the relation $ _1 (l-e 2 ) = -<J> _1 (£ 2 ): 

/ ns(p|| ( r)$- 1 (e 2 ) + 0(logn). 

(32) 



we use Theorem 



14 



Dl ax (p n \\a n )=nD(p\\a) 



B. Asymptotic Behavior of Operational Quantities 

We first treat source compression with quantum side in- 
formation. Recall Theorem [9] which provides the following 
bounds on m £ . 

For any CQ state pxB and < v < e < 1, we have 

-DKpxbUx O Pb) < max-L»|(/9xs||lx O ub) 

ob 



< m E {X\B) p < -Df^ipxBWlx ® Ps) 



2 3 e 

log—. (33) 

if 

We now consider the i.i.d. asymptotic setting with p XB and 
its marginal p B . First, note that, X(p B ) = nX(ps) is linear in 

and, thus, \og9(p r j 3 ) is of the order O(logn). Furthermore, 
the choice r\ = l/y/n ensures that the additive terms in ( [33] ) 
are of the order O(logn). 

Combined with (31) , this yields the following result. 

Corollary 15: For any CQ state pxB and any < e < 1, 
we find the following i.i.d. asymptotic expansion: 



Note that a simple application of Theorem [14] is not suffi- 
cient for deriving the i.i.d. asymptotic expansion of £ E (X\B) p 
because of the optimization concerning a B and the fact that 
we cannot bound 9{cr B ) for the optimal a B . 

Instead, we use the following relation (cf. Proposition [T3), 

DL^PxbUx^ctb) 



3 3 e 

(Pxu||lx®0-B)-Iog 



and Theorem [9] which yields 



maxU 



1-e -/x 



(P 



XB\\±X 



' <?b) 



> D 



l-£ 2 -p-8 



{Pxb\\1x ® pb) - log 



2 3 e 



Combining the above relations, we obtain 

2 3 



--Dmax^XsHlx ® Pb) - log ■ 
<t{X\B) p 



< max-L>^ ax (p XB ||l x ® cr s ) 



< max -L<, 



< — D 



1-e -ij-<5 



3 £ 

(Pxb||1jc ® crs) + log 

2 3 3 3 £ 2 



(pxbUx O Pb) + log 



(34) 



: {X n \B n ) = nH(X\B) - y^ns(X\B) $ _1 (e) + O(logra). Axed. We use s = s(X\B) p , r = r{P t 



We now consider the i.i.d. asymptotic setting with p XB and 
its marginal p B . Again, note that, X(p B ) = nX(ps) is linear 
concerning n, and, thus, \og6(p B ) is of the order O(logn). 
Furthermore, the choice rj = p = S = l/y/n ensures that the 
additive terms are of the order O(logn). 

This yields the following expansion due to (3~T) and ([32). 

Corollary 16: For any CQ state pxB and any < e < l, 
we have the following asymptotic characterization for large n: 

£ e (X n \B n ) = nH(X\B) + y^siXlB)®- 1 ^ 2 ) + O(logn). 

We employed the conditional entropy 

H{X\B) p :=-D{pxb\\Ix®Pb) 
as well as s(A\B) p := s(p AB \\ 1 A <gi p B ). 

VII. Finite Block Length Analysis 

Our results of the previous section also directly imply 
bounds for finite block lengths, i.e. computable bounds on the 
operational quantities for fixed, large n. To get such bounds, 
we simply carefully combine the results presented above, 
which yields the following. 

Theorem 17: Let pxB be a CQ state and < e < 1 be 

and 



PXB-.PB I PXB j PB . 



To analyze source compression with quantum side informa- 
tion, we start with the one-shot characterization of l e given in 
Theorem [8] For any CQ state pxB and < e < l, we have 

-D^KpxbUx ® Pb) - log ^ < £ £ (X\B) p 
< max-£>^ ax (pxB||lx ® cr B )- 



5 If / is continuously differentiable, c a constant and n > no, we may 
write v / n/(x + ~^=) = \/nf{x) + cf'(a) for some a £ [x, x+ ~t= ]- 



A = A(p B )^]Moreover, let < £ < min{£, 1 - e}. Then, for 
any n > c ' 2 r B , we have 

f r~ « i 2 9 £[nAl 
sup i( £+ g)_i og 

«0<?<l-e I (!- £ -0 



< 



inf (-^ g $-i( £ - C )+log 2233g J^ A1 1 



6 Note that, generally, r(P PxB tPB \\Q PxB , PB ) + r(p xs ||l x ® p B ), 
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Furthermore, let < £1 < minje 2 , 1 — e 2 }. Then, for any 
n > R 2 r R , we have 

5 5 e 4 \nX] 



sup 

€i<€<e 2 



/ns<f>- L (e z -£) - log 



<^(X n |B n ) p »-nff(X|B), 



Q3 
\/ns ] 



ni-e) 



< inf 

£l<£<!-£ 2 



/ns$ -1 (£ 2 +£) + log 



CV3 



The remaining optimization over £ is can be performed 
numerically. Note that any £ in the required range gives 
valid lower and upper bounds on the operational quantities. 
Moreover, the asymptotic statement can be recovered when 
choosing £, £o an d £1 proportional to 1/y/n. 

Proof: To get the first statement, we bound (f3~3"]l using 



Theorem 14 This yields 

(PpxBtPB WQpxB.PB 

< m £ {X n \B n ) pn 
<-D^(P PXB , PB \\Q 



log 



2 8 e6(p r i 



5 4 {l-e-S) 
2 3 e9(p n B ) 



Pxi 



,.) + log ; 



We further bound 9(p B ) < 2\nX\ and choose r\ = 25. The 
Berry-Esseen Theorem ( f29] > then gives us the expected bounds 
when we substitute £ in the argument of Note also that 
the parameter £ can still be optimized over. 

Similarly, bounding ( f34] > using Theorem [T4| yields 

2 3 9(p B ) 



_ ni-(£-^) 2 +<5cp I 

< £ e (X"|B n ) p n 



log 



?7 4 (5(1 



1-e' 



-fJ,-T]-6(p 



Pxb,Pb 



\Q 



Pxb,Pb 



)+log 



2 3 Z z e 2 9{p n B ) 



p 3 rj 2 S 



(35) 



The expression can be simplified using 9{p B ) < 2\nX~\. Then, 
the upper bound follows by choosing p = 3<5 and r\ = 28 and 
substituting £ as above after applying p9| ). 

The optimization leading to the lower bound is a bit more 
involved. However, it is easy to verify that the choices r\ = || 
and + lead to [e — i]) 2 — 5 = e 2 — (. Then, further 

bounding 5 > and substituting £ as above, we arrive at 
the desired bound. ■ 



VIII. Relation to Quantum Information Spectrum 

In the framework of the quantum information spectrum 
method, one treats general sequences of quantities a = 
{ a n}^=i and investigates their asymptotic behavior. Given a 
sequence of Hilbert spaces, JC, and two sequences of states, 
p and a, such that p n ,a n G S(fK n ) for all n, the quantum 
information spectrum is defined as lfl2l 

D(e\p\\a) := sup {R g E | limsup trp n {p„ < 2 nR a n } < e}, 

S(£|p1|ct) := inf {R G E I liminf trp n {p„ < 2 nfl CT„} > e} 

= sup{i? G E | liminf trp„{p„ < 2 nfl cr„} < e}. 



Our goal is to show that this can be expressed in terms of the 
entropic quantity D e s that was used in the previous sections. 
For this purpose, we need the following lemma. 

Lemma 18: Let g be a sequence of monotonically increas- 
ing functions and define f n (e) := sup{i? | g n (R) < e}. Then, 

sup { liminf f n {e n ) | lim sup e„ < e} 

= sup {R G E I limsup g„(i?) < e}, and (36) 

sup { liminf f n (e n ) liminf e„ < 

- n— >oo n— >oo 

= sup{i?GE|liminf 5 „(i?) <ej. (37) 

Proof: We prove Eq. ( |36| ) and simply note that Eq. ( f37j ) 
can be shown analogously. 

By definition of the supremum, for any 6 > 0, there exists 
a real R' satisfying lims\xp n ^ 00 g n (R') < e and 

sup {R G E | limsup g n (R) < e} < R 1 + S. 

n— >oo 

We now define a sequence e = {e n }%Li using e„ = g n (R'). 
Then, we have lim sup^^^ e„ < e and, since i?' < 
fn{g n (R')) = f n (tn) for all n by definition of /„, we find 
R! < liminf^oo f n (e n ). Hence, 

R' < sup { liminf f n (e n ) | lim sup e n < e}. 

Conversely, there exists a sequence e' satisfying satisfying 
limsup,,^ e' n < e and 

sup { liminf /„(e„) I lim sup e„ < e} < liminf f n (e' n ) + 8. 

f n^-oo n-*oo n-J-oo 

We now define R = lim inf „_j.oo /n( e n)- Then, by definition 
of the limit inferior, there exists an integer no sucn mat fi— <5 < 
/ n (0 f° r n — n o- Hence, — <5) < e' n for n > n and, 

thus, limsup,^^ 5„(i? — 5) < limsup^^^ e' n < e. Thus, 

liminf f n (c' n ) < sup {i? | limsup g„(i? — 5) < e} 
= sup {i? | limsup g n {R) < ^} + 5. 

Since we may choose <5 arbitrarily small, the above inequalities 
establish equality in d36b. ■ 



We now employ Lemma 18 using the sequence of func- 
tions g n (R) = trp n {p n < e nR cr n }, and, hence, /„(e) = 
D e s (p n \\<j n ) by definition. This yields the following equalities. 

D(e\p\\a) — sup { liminf —D t s Tl {p n \\ a n ) limsupe„ < el, 
? I n->oo n n-Hx > 



D{e\p\\S) = sup { liminf ~D e s n {p n \\a n ) 



p L n— yoo n 



lim inf e„ < e 



}• 



This shows that relative entropy D s constitutes an entropic 
version of the information spectrum D and D_. 

Together with the hierarchy derived in the previous section, 
this allows us to relate various information quantities to the 
quantum information spectrum. As an example, the following 
operational quantities are used to analyze quantum hypothesis 
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testing ifTZl . The asymptotic achievability is given by 
B(s\p\\a) :— sup { limi 



inf — log(cr„,<3„) e(Q)<e\ 
oo n J 



sup <^ R G 



1 



3Q : liminf — log(cr„, Q n ) > R A e(Q) < e\ 

n— >oo 71 J 

sup ( lim inf - D\ " (p„ 1 1 a n ) lim sup e n < e \ , 

? I n-s-oo 71 ra->oo > 



where we used e(Q) = limsup n _ j . 00 (p„, 1 — Q n ). On the other 
hand, the asymptotic converse is described by 

-1 



B^e\p\\a) 
inf ji? e IS 



sup 



i inf — log (a n ,Q r 



e(Q) < e} 



ip <. lim ii 

=; I n— 

» : liminf — log(a„, Q n )>R=^ e(Q) > e\ 

n— ¥oc n J 

sup ( lim inf - (p„ 1 1 a n ) lim inf e„ < e X , 

-* l. n— foo 71 n— >-oo J 



where we used e(Q) = lim inf n ^oo (Pn, 1 — Qn)- The equal- 
ities with the expressions involving the one-shot entropy can 
be verified by choosing e„ = (p n , 1 — Q n ) for any sequence 
Q. Conversely, for any sequence e satisfying the constraint, 
we choose Q n as the primal optimal solution for D e ^ l (p\\a). 
Using Lemma [12] we can confirm the following result fl2l : 

B(e\p\\a) = D(e\p\\a) and B\e\p\\a) = D(e\p\\a). 

Furthermore, Theorem [14] and Lemma [12] together imply that 
as long as the number of distinct eigenvalues in cr n or the 
logarithm of the minimum eigenvalue in a n grows at most 
polynomially in n, we get the following, novel, relations: 

B(e\p\\a) = £?(e|Po||Pi) and B\e\p\\a) = £?t( £ | j5 || 

where Pi is the sequence of classical distributions {Pi, p „,<r n }n 
as defined in Section [VI] The latter quantities can be bounded 
further using results from classical hypothesis testing. 

Furthermore, we want to point out that our analysis can be 
used to extend results by Datta and Renner (9) relating the 
information spectrum for e £ {0, 1} and smooth min- and 
max-entropies to arbitrary < s < 1. If the eigenvalues of 
a n satisfy the condition of the previous paragraph, then ( [18] ) 
and ( p~9] > imply the following results: 

D{e\p\\a) = sup { liminf - (p n \\a n )\ lim sup e„ < e}, 

e n->oo n n->oo 

D(e\p\\a) = sup { liminf -D^^(p n \\<T n )\ liminf e„ < e}, 

^ n— too TL n— fcso 

which can be readily specialized to conditional entropies. 

Finally, we want to point out that the sequences of rates, 
{±m^(X\B) Pn } n and {^{X\B) Pn } n , can be expressed 
asymptotically using the information spectrum method anal- 
ogously to the case of hypothesis testing. Our results then 
show that their asymptotics are equal to the asymptotics 
of {±H s h «(X\B) p J n and {±H^ n (X\B) p J n , respectively. 
Furthermore, if the abovementioned conditions on the eigen- 
values are satisfied, these expressions correspond to the in- 
formation spectrum, D and D. This is discussed in detail in 
Appendix [B] 



IX. Conclusion and Discussion 

We characterize both source compression and randomness 
extraction with quantum side information using one-shot en- 
tropies in such a way that the second order asymptotics of 
these tasks can be recovered. This result improves on previous 
characterizations of these quantities that were only shown to 
converge in the first order. 

We want to point out the relation of our result to the 
smooth entropy framework that has recently been employed to 
characterize various quantum information theoretic tasks in the 
one-shot setting. Such characterizations allow to recover the 
correct asymptotic behavior in the first order, and that is often 
taken as a sufficient reason to call them "tight". However, we 
stress that a first order analysis is independent of the required 
security or error parameter, ej^j Hence, a characterization with 
HL± is equivalent to a characterization with H^ in , H^J n 



or even H„ 



in the first order — a freedom that is often 



used extensively to prove these results. In the second order, 
however, the above quantities behave very differently. Hence, 
tightness in the second order requires a more precise analysis 
of the one-shot problem, resulting in a characterization of the 
operational quantities in terms of HL^ plus terms that grow 
at most proportional to log - when r/ — >• 0. 

It appears that such a characterization is only possible 
in terms of a carefully chosen one-shot entropy, depending 
on the task at hand. We show that the hypothesis testing 
entropy, Hf n , allows a tight one-shot characterization of 
source compression, while the smooth min-entropy, HL^ 
takes the respective role for randomness extraction when the 
secrecy criterion is given in terms of the purified distance. In 
conclusion, we do not expect that a single one-shot entropy is 
sufficient to characterize all relevant tasks such that the correct 
second order asymptotics can be recovered. 



Finally, we established in Section VIII that the behavior of 
the asymptotic information spectrum of a task — both its direct 
and converse part — can be expressed as an appropriate limit 
of the respective one-shot quantity. Hence, a thorough analysis 
of the one-shot quantity leads also to the understanding of the 
behavior of the asymptotic information spectrum. 
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Appendix A 
Example of Finite Block Length Analysis: 
Eavesdropping on Pauli Channel 

We consider the state that results when transmitting either 
|0) or |1) through the complementary channel to a Pauli 

7 In the contrary, such an analysis is expected to yield the same first order 
asymptotic expansion for all < e < 1. 
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channel with a phase error p < i that is independent of the 
bit flip error. The resulting state is 



PXB 



1 1 

-Y^m^WWl where 



i<« = v?|o> + (-i)Vr^|i). 

Morever, we note that the non-trivial part of the Nussbaum- 
Szkola distribution for this state reads 



P = P 



PXB.lxt 



Q — QpxbAj 



fp p 1 -p l-p \ 

1 2 ' 2 ' 2 ' 2 J 
(?rV,(l-p) 2 ,(l-p) 2 }. 



and 

2 ] 



We are interested in how much randomness can be extracted 
from n i.i.d. copies of this source for finite n, i.e. we want to 
find bounds on l e {X n \B n ). We first bound this in terms of 
the classical information spectrum. For any choices of £1 , £ 2 € 
(0, 1), we write using (|35]l, 



-D\-^-*XP»\\qn- log 

< l £ (X n \B n ) 

< _^(i + « 2)(pn|l0 ™ )+log m!jAi (38) 



Then, we note that D e g (P® n ||Q® n ) can be evaluated precisely 
as follows. First, we recall ((281 and write 



D e s (P n \\Q n ) = sup {i? | P"{ £ Z 4 < i?} < e], 

i 

where Zi = log P — log Q is a random variable that takes 
value log ^ with probability p and value log 2 (i- P ) w i m 
probability 1 — p. We rescale this into a Bernoulli trial 
fl< = (Z t - log (log and find 

1 s v ' 

R 

where we used that log — - is positive for p < | . Hence, 



Df(P"||Q") 



sup jp P n {Yl Bl - ft] - £ ] lo § + n lo S 



1 



max {fc € N |P(fc — 1; n,p) < e} log 



P 



n log 



2(l-p) 
1 

2(1-P)' = 



Appendix B 

The Information Spectrum of Source Compression 
and Randomness Extraction 

Here, we treat source compression and randomness extrac- 
tion using the information spectrum method. That is, we focus 
on the asymptotic operational quantities for general sequence 
of information sources pxb '■= {pxB,n}n- 

We define 

m(e\X\B\p) 

:=inf{limsup - logA^(P„)| limsupp crr (P n ,p XB ,„) < e} 

T> n— >oo ^ n—t-oc 

= inf {lim sup — m £n (X\B) Pn | limsup£ n < e} 

£ n— foo ^ n— >-oo 

m^elXlPlp) 

:= inf {lim sup - log.M(P„)| lim inf Pen(V n ,PxB,n) < e} 

V Tl-i-OO 0, Jl-i-OO 

= inf {lim sup — m £n [X\B) p | lim inf e n < e} 

S n->oo Tl " n-i-oo 

:=inf{liminf-logZ(P„)|limsup(i S ec(7 : 'r l ,PxB.rO < e} 

■p n-i-oo n n->oo 

= inf {lim inf — £ e "(A"|P) Pn | lim sup e n < e} 

S n-i-oo n " n-i-oo 

t\e\X\B\p) 

:=inf{liminf - logZ(P„)| liminf d sec (V n , pxB.n) < e} 

■p n— foo Tl n—*oo 

= inf{liminf-F"(X|P)„„|liminfe„ < e} 

e n— >oo ft n n— foo 

In order to characterize these quantities, we define the asymp- 
totic quantum conditional entropies: 

H(e\X\B\p) 

:=sup{PeR|liminftr p X B,n{pXB, n > ^ nR p B ,n} < e} 

n— too 

H{e\X\B\p) 

:=sup{Pe R|liminftrpxB,n{/Oxs,n > 2~ nR PB,n} < e}. 

n— i-oo 

Using the classical distribution, we can define another kind of 
asymptotic quantum conditional entropies: 

H c (e\X\B\p) 
:=sup{P € R| lim inf P„{P„ > 2- nR Q n } < e} 

n—¥oo 

H c (s\X\B\p) 
:=sup{P e R| lim inf P n {P n > 2~ nR Q n } < s}, 

n—^-oo 

where P„ := Pp xi3j „,p B ,„ and Q n := Qp XB>n , PBtn . 

Similarly, we can define the asymptotic smooth conditional 
min-entropy 



1 



H min (e\X\B\p) :=sup{liminf-P I e n '; n (X|P) p Jliminfe„ < s} 



where F(-;n,p) is the cumulative distribution function for 
the binomial distribution and the remaining optimization can 
be evaluated numerically. Combining this with ( |38| l, we can 
thus evaluate direct and converse bounds on the extractable 
randomness in Fig. [T] 



H min {e\X\B\p) :=sup{liminf-fl"^ n (X|fl)p n |limsupe n < e}. 



n^oo „ '™ l " r/p " „ , v 



Applying Lemma 18 to the case when / n (e) = 

^Dl- £ (p X B,n\\PB,n) and g„(R) = PXB,n{PXB,n > 
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2 nR pB,n}, we obtain 
H(s\X\B\p) 

= sup{liminf--L>^ £ "(p XB:n || j o i 3 !n )|liminfe, i < e} 



H( E \X\B\fl 
= sup{lim inf - 



Dl 



(39) 

■{pxB, n \\pB,n)\ limsup£„ < e}. 

n— »oo 

(40) 



The same type inequality holds for H c (e\X\B\p) and 
H c (e\X\B\p). 

Now, we substitute e n and i into e and 77 in Theorem pi 
,1 ....k,..; > 1 into e and <5 in Lemma 1 1 • 



and substitute e n and 
relations <[39l and d40 



m(e|X|S|p) = H(l - e\X\B\p) 
holds for < e < 1 and 

,t 



12 



m T (e|X|.B|£) = H(l - e\X\B\p) 



Due to 



(41) 



(42) 



holds for < e < 1. 

Similarly, we substitute e„ and - into e and 77 in Theorem 
H] Then, 



holds for < e < 1 and 

^el^TO = H mln (s\X\B\p) 



(43) 



(44) 



holds for < e < 1. 

In the following, we assume the above mentioned conditions 
on the eigenvalues. That is, the number of distinct eigenvalues 
in a n or the logarithm of the minimum eigenvalue in a n is 
assumed to grow at most polynomially in n. Then, combining 
relations d39l and d40|, LemmafT2l and TheoremfT4l we obtain 



H(s 2 \X\B\p) = H c (e 2 \X\B\p) = H min (e\X\B\p) (45) 
H{e 2 \X\B\p) = H c {e 2 \X\B\p) = H min (s\X\B\p). (46) 

Note that these equations hold without the above mentioned 
conditions on the eigenvalues in the commutative case. 
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